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ABSTRACT 

Represent  a  time-shared  computer  system  as  a  group  of   N   terminals,  each 
having  submission  rate   A   and  exponential   (u)   task  durations,  with  tasks  sub- 
mitted to  a  central  (single)  processor.   There  they  are  serviced  in  processor- 
sharing  or  time-sliced  mode.   It  is  shown  that  the  R(t),   the  response  time 
conditional  on   t  ,  the  required  processing  time,  becomes  approximately  normally 
distributed  as   t   increases.   Similar  results  are.  derived  when  N  increases. 

Variations  of  the  model  consider  control:   an  "inside,"  processor-shared 
queue  services  at  most   c   tasks,  others  queueing  first-come  first-served 
"outside."   Other  possibilities  are  described  and  analyzed. 

1.   Introduction 

The  abstraction  of  computer  capacity  allocation  known  as  processor  sharing 
is  an  attractive  simplification  of  time  slicing,  sometimes  called  round-robin 
scheduling.   The  idea  is  well  known  (Kleinrock,  1976) :   given  that   j   jobs  or 
programs  are  at  the  execution  stage,  each  receives  service  equal  to  one-j_th  of  a 
time  unit  per  time  unit.   In  other  words,  if  the  chance  that  any  single  job, 
processed  alone,  finishes  in   (t,t+h)   is  uh  +  o(h),  (exponential-Markov 
service),  then  the  chance  that  a  particular  ("tagged")  job  in  the  company  of 
(j-1)   others  finishes  in   (t,t+h)   is   y(h/j)+o(h)   as   h  -»■  0.   Processor 
sharing  of  the  above  type  tends  to  be  equitable  in  that  it  permits  short  jobs 
access  to  processing  even  if  they  arrive  after,  and  queue  with,  longer  jobs. 

Apparently  the  first  study  of  delays  to  arriving  and  queueing  jobs  under 
processor  sharing  was  conducted  by  Coffman,  Muntz,  and  Trotter  (1970).   They 
assumed  a  steady  state  M/M/l   system  with  processor  sharing,  and  were  able  to 
determine  properties  of  the  response  time,   R  ,  given  the  processing  time 
required  by  the  arriving  job.   Other  papers  have  also  appeared. 


A  recent  paper  by  D.  Mitra  (1981)  analyzes  response  time,   R  ,  under  the 
assumption  of  a  closed  system.   Idealize  the  behavior  of  a  system  of   N   termi- 
nals and  a  single  computer  as  a  classical  machine-repair  situation:   each  think- 
ing terminal  (failure-prone  machine)  applies  for  computer  service  at  rate  X    , 
and  queued  or  waiting  jobs  are  served  at  rate  u   as  long  as  any  jobs  are 
present.   Markov  assumptions  are  made  throughout,  so  X(t)  ,  the  number  of  jobs 
at  the  service  stage,  is  a  birth  and  death  process  with  transition  rates 

X(t)  =  j  +   X(t+h)  -  j  +  1  :   X.h  +  o(h)  (1.1) 

->  X(t+h)  =  j  -  1  :   u  h  +  o(h) 

-*■  X(t+h)  =  j     :  1  -  (X.+u.)h  +  o(h) 

and  in  particular   A.  =  A(N-j),   u.  =  y   for   j  >^  1  ,  otherwise  being  zero.   Let 
processor  sharing  govern  service  effort  allocation.   In  Mitra  (1981)  the  dis- 
tribution of  response  time  is  characterized,  and  the  moments  (e.g.  mean  and 
variance)  are  found  under  interesting  conditions,  such  as  that  the  tagged  job 
requiring   t   time  units  of  processing  arrives  to  find  j  -  1  accompanying  jobs; 
the  conditional  response  time,  given  only  processing  requirement   t  ,  is  given 
particular  attention. 

In  this  paper  the  previous  analysis  is  generalized  and  extended.   We  intro- 
duce the  idea  of  processor  sharing  in  an  arbitrary  birth  and  death  process  en- 
vironment, thus  allowing  quite  general  terminal-computer  interactions  to  be 
represented.   In  the  process,  the  meaning  of  "system  state  at  the  moment  of  tag- 
ged job  arrival"  is  clarified;  see  also  recent  work  of  Lavenberg  and  Reiser 
(1981).   Response  time  characteristics  are  computed  under  the  assumption  that 
processor-sharing  service  rates  are  processor-state-dependent  in  a  more  general 
way  than  that  described  earlier;  this  allows  for  approximate  representation  of 
overhead  penalties  and  also  of  job  scheduling.   Other  characteristics  of  tagged 
job  response  are  also  studied,  e.g.  the  accumulated  processing  work,   W(x)  , 
actually  performed  on  that  job  by  elapsed  time  t(t  <  t  =  required  processing 
time)   following  job  introduction;  note  that  W(R)  =  t  ,  so  the  first  passage  of 
W(t)   to   t   is  actually  the  response  time. 

Although  differential  equations  may  be  obtained  for  transforms  of  W(t) 
under  various  initial  conditions,  and  hence,  implicitly,  for  its  distribution, 
the  results  are  far  from  being  explicit  and  informative.   However,  central  limit 
theorems  for  additive  functionals  of  Markov  processes,  or  for  cumulative  pro- 
cesses, allow  the  conclusion  that  the  accumulated  work  accomplished  by  fixed 
time  t   on  a  "long"  job  is  approximately  normally  distributed  (Gaussian) .   This 
fact  in  turn  allows  the  conclusion  that  the  response  time  for  a  "long"  job  is 
also  approximately  normally  distributed.   Additionally,  the  normal  approximation 
may  be  shown  to  be  valid  for  our  simple  model — and  probably  for  others  as  well — 
when  the  number  of  competing  terminals  becomes  large,  i.e.  under  heavy  traffic 
conditions.   The  quality  of  the  normal  approximations  for  finite  job  lengths  is 
currently  being  assessed  by  simulation  methods. 

In  the  latter  part  of  this  paper  we  describe  queue  control  in  a  processor- 
sharing  environment.   The  expedient  is  to  limit  the  total  number  of  jobs  allowed 
simultaneous  processor-shared  service  at  an  "inside"  queue,  with  any  excess  in 
"first-come,  first-served"  status  in  an  "outside"  queue.   Long  jobs  are  also 
shifted  from  inside  to  outside  by  a  sampling  mechanism.   It  is  shown  that  long 
jobs  are  favored  by  a  small  inside  span,   c   (c  being  the  number  simultaneously 
processor-shared),  while  short  jobs  are  favored  by  large  c  . 


2.   Mean  Response  Times 

Begin  by  describing  differential  equations  for  the  mean  response  time  to  be 
experienced  by  a  tagged,  particular,  arriving  job.   Other  moments  satisfy  very 
similar  equations. 

(a)   Conditioning  on  Required  Time  and  System  State. 

Throughout  what  follows  Markovian  assumptions  are  made:   service  times 
at  the  computer  are  independent  and  exponential  (u) .   Generalizations  to  phase- 
type  distributions  are  apparently  possible. 

Let  R  refer  to  the  response  time  of  a  newly  arrived  job,  and 

m^t)  =  E[R|X(0)  =  j,  W(R)  =  t]  ,  (2.1) 

the  conditional  expectation  of  response  time,  given  that  the  job  is  initially  in 
the  company  of  j   others  (arrives  to  find  j  -  1  present)  and  requires  "work" 
or  processing  time  equal  to   t  .   Let   X.h   (resp.   u.h)   for  small   h   identify   . 
the  infinitesimal  generator  of  the  accompanying  process,  so  transition  rates  are 
as  in  (1.1). 

Consider  the  possible  system  changes  in   (0,h),   and  subsequently;  the 

(~  r  ( i ) 

let   u.  =  u(j-l)  — . 


m  (t)  =  h  +  m.(t  -  ^^A  •  h)[l  -  (A  +y  )h] 

+  X.hm^  (t  -  ^A  •  h)  +  *y  .hm.  .  (t-  ^A  •  h)  +  o(h)  . 
j   J+l       J  J   J-l      J 


(2.2) 


The  term  r(j)   represents  the  fraction  of  time  the  processor  actually  spends 
processing  when  there  are  j   jobs  being  processed. 

Allowing  h  •*  0  one  finds  the  differential  equations 

■^-m.'(t)  -  1  -  (A,  +  y.)m.(t)  +  A.m.  ,.  (t)  +  y  m   (t)  .      (2.3) 
j    J  3  3      3  3   3+i       3    J--1- 

This  is  a  standard  system  of  linear  differential  equations  with  constant  co- 
efficients; initial  conditions  are  m.(0)  =  0   for  all  j  . 

(b)   Conditioning  on  Required  Time. 

If  one  removes  the  condition  that   X(0)  =  j   in  accordance  with  the 
stationary  distribution  appropriate  for  an  arriving  job  it  follows  that  the 
expected  response  time  is  linear  in  the  required  processing  time,   t  .   This 
holds  for  quite  general  birth-and-death  process  models,  and  not  just  for  the 
simple  machine-repair  setup;  see  Cohen  [1979].   Here  is  the  derivation,  in 
outline. 

First,  observe  that  the  long-run  distribution  of  X(0)  ,  the  number  of  jobs 
present  just  after  the  tagged  job  enters,  is 


q.  -  c*j..x*1-l  "  en  pr(J)      j  -  1,2, ...,N  ,  (2. A) 

where  c   is  selected  so  that  the  q.,'3  3um  to  one.   Recall  that 

XXX  ■* 

0  1**  *    1-1 

».   ■  Tfrt  J Is   the   stationary   distribution    (assumed    to  exist)    of    the 

J    0  m1p2...m 

Markov  chain  X(t)   defined  by  (1.1)  with  p   =  pr(j)  .   This  Is  intuitively 
apparent,  but   a  formal  proof  can  be  based  either  upon  an  embedded  Markov  chain 
formulation,  or  upon  the  theory  of  additive  functionals  of  a  Markov  process; 
see  Cinlar  ((1975),  pp.  269-271).   The  distribution   {q.}   has  also  been  given 

by  Kelly,  (1979),  p.  12. 

Use  (2. A)  to  remove  the  condition  that  X(0)  =  j;   put 


IN 

m(t)  =  EY,mE[R|x(0),  W(R)  =  t]  =  J  q,m.(t)  .  (2.5) 

Then  in  terms  of  the  differential  equations  (2.3);  after  multiplying  through  by 
j/r(j)   one  obtains   (m«(t)  =   0) 

N  N 

m'(t)  =  £  Hi)   qJ  +  &  TO"1'  W*J(t)  +  Wi(t)  +  Vj-i(t)1 

■  j[  «r>  qj  •  (2-6) 

Thus  it  follows  that  the  long-run  conditional  expected  response  time  is  linear 
in  the  processing  time  requirement: 


E[R|W(R)  =  t]  =  t  I     -±rr   q   =  tE 

1=1   U;   J 


X(0) 


r(X(0)) 


(2.7) 


Apparently  no  such  simple  form  exists  for   Var[R[w(R)  =  t]  ,  although 
Mitra  (1981)  has  given  a  formula  for  a  particular  case.   It  will  be  seen, 
however,  that  the  above  variance  is  indeed  proportional  to   t  if   t  is  large. 

3.   Total  Work  Completed  on  a  Tagged  Job  in  a  Fixed  Time 

Turn  attention  now  to  W(t)  ,  the  total  work  expended  by  the  computer  on 
the  tagged  job  by  time  T   after  its  arrival,  given  that  the  tagged  job  requires 
exactly   t   time-units  of  work  for  completion.   If  when  the  job  arrives  there 
are  X(0)  =  j   customers  present,  then 


W(t)  -  /  r(j[((^)  du,        X(0)  =j>l.  (3.1) 


(a)   The  Laplace  transform  of  W(t) . 

Here  is  the  derivation  of  a  differential  equation  for  the  Laplace 
transform  of  W(t) : 


4>  (s,Tjt)  =  E[e  '  u'|x(0)  =  j,  R  =  t]  ,     for    j  >  1         (3.2) 


Now  argue  in  a  manner  analogous  to  the  discussion  prior  to  (2.2)  to  write  a 
backward  equation:   for  0  <  h  <  t 

<f>   (s,T;t)   -   (e~Sr(j)h/j)[l-U .+  y ,)h]4» .(s,T-h;t) 

J  J  J  J 

+    (e"Sr(j)h/j)U|j+1    (s,x-h;t)  (3.3) 


+   (e   Sr(j)h/j)Mjh^_1    (s,T-h;t)    +   o(h)     , 


leading  to 


d<J>.  ^  ^ 

-^  =  -    ttj    +  Rj    +  sr(j)/j)^    +  Aj(j,j+1  +  yj4)j_1    .  (3.4) 


Initial  conditions  are 


^(s.Ojt)  =  E[e"sW(0)|x(0)  =  j,  R  =  t]  -  1  ,    t  >  0  ,        (3.5) 

since  initially  W(0)  =  0  ,  regardless  of  the  job  requirements  or  the  initial 
environment. 

(b)   A  central  limit  theorem  for  W(t) . 

Examination  of  (3.1)  shows  that  W(t)   involves  sums  of  contributions 
to  work  accumulated  while  the  system  inhabits  various  states  during  the  period 
(0,t).   This  suggests  that,  at  least  for  "long"  jobs,  i.e.  such  that  required 
processing  time   t  -+  «°  ,  one  can  anticipate  a  nearly-Normal  distribution  for 
W(t)  .   An  appropriate  central  limit  theorem  that  establishes  this  for  finite 
birth-and-death  models  can  be  found  in  Keilson  ((1979),  p.  121);  call  this 
Theorem  K.   Alternatively,  one  can  make  use  of  the  theory  of  cumulative  pro- 
cesses, see  Cox  ((1962),  pp.  99-101);  the  latter  development  is  adaptable  to 
models  more  general  than  the  simple  birth-and-death  process. 


In  order  to  apply  Theorem  K  redefine  the  infinitesimal  generator  (1.1)  to 
describe  the  behavior  of  the  accompaniment ,  X' (t)  ,  of  the  tagged  customer;  note 
that  the  relevant  generator  is  now 


X'(t)    -  j   -*■  X'(t+h)    =   j   +  1    :        A'h  +  o(h) 


Xj+lh  +  o(h) 


(3.6) 


->  X'(t+h)    =  j    -   1 


u!h  +  o(h) 


+  X'  (t+h)    =   j 


=  y 


J+lU+i 


At  h  +  o(h) 


l-(Aj   +  uj)h  +  o(h) 


(3.7) 
(3.8) 


for      j   =  0,1,2,...,      N'    =  N  -   1    .      Then 


W(t)   =   /     f (X' (u))du 
0 


_    rT   r(X'  (u)+l)du 


-=/ 


X»(u)    +   1 


(3.9) 


and   theorem     K     states   that 


W(T)     -     £T 

o/F 


(D) 


N(0,1)    ; 


(3.10) 


the  constants  E,      and  a        are  such  that 


N1 

C  =  I     f(J)ir] 
j=0      J 


(3.11) 


,2  _ 


oz  =  2[f(0),f(l),...,f(N')] 


V 


f(0) 

& 

f(l) 

• 

• 

_f(N')   g 

(3.12) 


where  in  the  present  case  the  definition  of   f(»)   is  implicit  in  (3.9),  and 
£  is  the  matrix 


£  =  Y{[I"4+JU  *  ~  k} 


(3.13) 


£     being  the  identity,  and 


0 
'0 


V 


v 


V 


(3.14) 


Jy-rows  are  steady-state  probabilities  for  the  accompaniment,  and  A,  is  defined 
as  follows: 


Ao,i  =  xo/Y  •   Ao,o  =  x  -  xo/y 


aj,j+i  =  xj/y  •  Aj,j-i  =  yj/Y  •  Ai,j  =  i-y* 


(3.15) 


v .  =  A !  +  y !  ;    Y  =  max  v .  . 
1  3  J  j    J 


(c)   A  central  limit  theorem  for  response  time,   R(t) . 

A  graph  of  W(x)   vs.   x   starts  with  W(0)  =  0  and  increases  in 
random  straight-line  segments  until  W(x)  =  t.   The  value  of   x   at  which  this 
occurs,   x(t)  =  R(t),   is  the  first-passage  time  to   t   of  the  work  process 
(W(x) ,  x>0}  ,  and  is  the  required  response  time,  so 


P{W(x)<t}  =  P{R(t)>x} 


(3.16) 


Now  invoke  the  previous  theorem  (3.10)  concerning  asymptotic  normality  of  W(x) 
and  a  standard  argument  of  renewal  theory,  cf.  Karlin  and  Taylor  ((1979), 
pp.  208-209)  to  see  that  if   t  =  £x  +  /azxx  ,  then,  as   t  -*■  °°  , 


t    /ozt 


from  which  it  follows  that 


R(t)  -  at   R(t)  -  t/g 


(D) 


/$' 


/■ 


& 


N(0,1) 


(3.17) 


ft  ((I  i  '••  l-'K'" 

4.   Heavy  Traffic  Analysis  of  the  Response  Time  of  a  Processor-Shared  Job 

This  section  investigates  the  problem  of  delay  of  a  tagged  job  requiring 

t  units  of  processing  time  when  it  is  accompanied  by  many  others,  i.e.  is  in  a 

heavily  loaded  system.   Restrict  attention  to  the  machine  repair  model  in  which 

A.  »  X(N-j)   and  u .  ■  u  ,  and  omit  the  effect  of   r(j),  i.e.   r(j)  5  1  .   Let 

—1 
there  be  N  terminals  and  one  processor,  with  A    being  the  expected  terminal 

think  time  (exponentially  distributed),   u  =  Nu '   being  the  processing  rate  of 

arriving  jobs;   A  and  u'   are  fixed  but  N  is  large  and  the  service  rate 

scaling  by  N  is  required  in  order  that  queue  size  be  of  order   N  . 

Now  utilize  the  fact  (Iglehart  (1965),  and  Gaver  and  Lehoczky  (1976))  that 
if   X(t)   is  the  number  of  jobs  at  the  processing  stage  at   t   then  X(t)   can 
be  approximated  by  a  diffusion  process: 

X(t)  =  N  a(t)  +  Jfi  Y(t)  (4.1) 

where  a(t)   is  a  deterministic  function  of  time  and   {y(t)}   is,  for  the 
present  model,  a  particular  Ornstein-Uhlenbeck  process.   It  turns  out  that  when 

N  ■+  °° 


4|^=  A(l-a(t))  -  u'  (4.2) 

or 

a(~)  =  1  "X  (A. 3) 


which  is  feasible  if  A  >  u1  ,  i.e.  under  heavy  traffic  conditions.   Furthermore 

dY(t)  =  -  AY(t)dt  +  A(l-a(t))  +  u'  dB(t)  ,  (4.4) 

{B(t),t  >_  0}  being  the  standard  Wiener  process.   In  the  long  run, 

dy(t)  =  -  Ay(t)dt  +  v;Vr  dB(t)  .  (4.5) 

It  is  in  the  environment   X(t)   described  by  (4.1)  that  the  tagged  job  enters. 

It  encounters  competition  for  processor— shared  service,  and  so  its  accumulated 
work  completed  by  fixed  time  x   is  essentially 


WW  -  /  -x&  .  (4.6) 


Apply  the  approximation  and  expand  to  second  order  terms  in  N  ,  the  number  of 
terminals,  to  find 


W<T)  *  /T  .     d"   Y(u)  . 

T          . 
r          du 

""  }Q     Na(u) 

rT    *4T  Y  (u)du 
0        (Na(u))2 

L        a(u)/N 

(A. 7) 


For  simplicity,  and  to  enable  comparisons  with  previous  results,  let 

a(v)  ■  a(°°)  ,  so  the  tagged  job  arrives  in  the  steady  state.   Expression  (A. 6) 

then  says  that  for  the  approximation  advanced  here, 


W<TW  EU^JT"  (E[X(-)])*  /  Y(u)dU'   °±TiC  <*•») 


so  the  expected  amount  of  work  done  on  the  tagged  job  is  nearly  t/E[X(«)]  f 
and  the  actual  distribution  of  total  work  done  is  approximately  Gaussian 
(integral  of  an  Ornstein-Uhlenbeck  process) ,  where  the  Gaussian  property  results 
from  the  assumption  of  many  accompanying  jobs,  and  not  necessarily  because  the 
tagged  job  is  long. 

Standard  calculations  applied  to  (A.  7)  show  that,  as  t  ■+   °°  , 

T  T 

E(/  Y(u)du]  ^  0  and  Var[/  y(u)du]  =*  (2u'/A2)t,  so  the  normal  approximation  to 

0  0 

accumulated  work  W(t)   has  the  parameters 


lNa(~)]  '     "     Az    lNa(»)r    A^NaC-)]4 


from  which  it  follows  that  the  parameters  of  the  normal  approximation  to  R(t) 
are 


a  -  j  ■  [Na(»)]  c-  E[x(~)] 

(A. 10) 

«2  .  2p   1   _  2y    1    m   2  ,N  -  E[X(~)K 
P    A7  Na(«)  ~  I7  E[X(oo)]   X1  E[X(~)j   ;* 


These  formulas  state  that  if  think  (demand)  rate  X   is  very  large  then,  since 
E{X(»)]  ■*■  N  ,  the  variance  of  response  time  diminishes,  while  of  course  expected 
response  time  increases  like  N  .   This  is  plausible  since  in  extremely  heavy 

traffic  all  terminals  compete,  and  the  tagged  job  gets  a  steady  (1/N) —  of  a 
quantum.  For  smaller  X  the  expected  response  time  drops  with  E[x(°°)J  ,  but 
response  time  variance  increases. 

The  above  derivations  are  informative  but  not  rigorous.   Semigroup  methods 
of  Berman  (1979)  can  be  applied  to  place  the  results  on  a  mathematically  solid 
basis.   Numerical  assessment  of  the  results  is  also  of  interest. 


' 5.   Numerical  Comparisons 

In  this  section  a  brief  investigation  is  reported  of  the  numerical  agree- 
ment between  the  very  simple  formulas  from  heavy  traffic  theory  for  the  param- 
eters of  the  accumulated  work  distribution  and  those  of  direct  Markov-chain 
cumulative  process  theory  origin. 

Parameters  of  Total  Work 

Examples.   Let  u  -  1,   N  -  25,  50,  with  A  varying. 


N  -  25 


Rates(A) 


0.01 


0.0222.. . 


0.05 


0.10 


0.15 


£(M.Ch.) 
(Diffus.) 
c2(M.Ch.) 
(Diffus.) 


0.77 


0.44 


0.51 


0.75 


0.16 

0.067 

.055 

0.20 

0.067 

0.055 

0.22 

.0045 

0.00085 

1.28 

.0040 

0.00079 

N  «=  50 


Rates  (X) : 


0.01 


0.0222... 


0.05 


0*10 


0.15 


£(M.Ch.) 
(Diffus . ) 
o2(M.Ch.) 
(Diffus.) 


0.53 


0.83 


0.13 

0.033 

0.025 

0.023 

0.20 

0.033 

0.025 

0.023 

0.33 

0.0011 

0.000081 

0.000026 

6.48 

0.00099 

0.000078 

0.000025 

The  diffusion  approximation  and  Markov  chain  parameters  agree  remarkably 
well  when  traffic  is  heavy  (large  A),  but,  as  might  be  feared,  diffusion  fails 
miserably  for  small  A  . 


6.   Queue  Control  by  Service  Span  and  Interruption 

In  this  section  we  consider  queue  control.   The  central  processor  now  has 
finite  service  span,   c  ,  which  may  be  smaller  than  the  number,   N  ,  of  ter- 
minals.  This  means  that  if  there  are  i  _<  c  jobs  in  service  they  are  served 
as  before  "inside"  at  a  rate  ur(i)  ,  with  processor  sharing  in  effect.   How- 
ever, if  there  are  more  than  c   jobs  simultaneously  requesting  service,  only 
c  of  them  are  served  simultaneously,  and  at  rate  ur(c)  ,  also  with  processor 
sharing  discipline.   The  others  are  queued  "outside",  with  "first-come,  first- 
served"  service  discipline. 
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If  there  are  more  than  c   customers  requesting  service,  the  customers  that 
are  in  service  "inside"  experience  independent  service  interruptions  at  rate   v. 
When  service  is  interrupted,  each  job  in  service  is  equally  likely  to  be  moved 
to  the  end  of  the  queue;  thereupon  the  job  at  the  head  of  the  "outside"  queue 
immediately  enters  service.   Both  the  imposition  of  the  limited  processor 
sharing,  imposed  by   c  <_   N  ,  and  the  interruption  process  are  intended  to  control 
queueing  by  adjusting  the  relative  attention  given  to  short  and  long  jobs. 

Markovian  assumptions  are  made  throughout,  so  that   X(t)  ,  the  number  of 
jobs  requesting  service  at  time   t  ,  is  a  birth  and  death  process  with  transi- 
tion rates  given  by  (1.1). 

(a)   An  Auxiliary  Process. 

Let  R  be  the  response  time  of  a  newly  arrived  tagged  job.   Since  the 
tagged  job  may  not  be  served  until  completion  when  it  first  enters  service,  it 
is  necessary  to  introduce  an  auxiliary  process   {Y(t);  t  >_   0}   in  order  to  study 
R  . 

In  brief  summary,  if  there  are  i   customers  (including  the  tagged  job)  re- 
questing service  at  time   t   and  the  tagged  job  is  in  service,  then  the  state  of 
Y(t)   is   (i,0)  .   If  there  are   i  >  c  customers  requesting  service  at  time   t 
and  the  tagged  job  is  not  in  service  but  is  in  the  jth  position  in  queue,  the 
state  of  Y(t)   is   (i,j)  .   One  can  now  describe  the  possible  changes  in  Y(t) 
in  a  time  interval  of  length  h  ;  details  will  be  presented  elsewhere. 

Let 

m(i    (t)  =  EIR  |Y(0)  =  (i,j)  ,    W(R)  =  t]  ,  (6.1) 

the  conditional  expected  response  time,  given  that  the  tagged  job  is  initially 
in  the  company  of   (i-1)   others  and  either  it  is  being  served  inside 
(if  j  =  0)   or  it  is  jth  in  the  outside  queue  (if  j  >  0)  . 

Arguments  similar  to  those  of  Section  2  yield  differential  equations  for 

m,      ,x(t)   which  can  be  solved  numerically.   A  closed-form  solution  is  compli- 

v.i»  J  ) 
cated  and  uninformative.   It  is  possible  to  numerically  evaluate  the  mean 

response  time, 

m(t)  =  E[R|W(R)  =  t]  =  I     q(i>j)m(ij-j)(t)  > 

where  q,.  ..   is  the  initial  distribution  encountered  by  the  tagged  job. 
\*-»  J  ) 

The  mean  response  time  is  not  generally  linear  in   t   for  this  model.   Note 
that  if   c  =  N   then  this  model  is  equivalent  to  that  considered  in  Section  2 
and  hence  as  shown  is  Section  2,   m(t)   is.  linear  in  t  for  that  special  case. 

(b)   An  Approximation  to  Expected  Response  Time. 

A  useful  approximation  to  the  expected  response  time  for  a  job  requir- 
ing  t  units  of  work  is  obtained  by  the  following  argument.   Assume  that  the 
service  rate  for  the  tagged  job  is  the  same  throughout  its  processing  and  is 
equal  to  the  rate  that  it  experiences  when  it  first  enters  the  processor.   Thus 
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li  It).  l\ 


the  tagged  job  requires   t* 


it 
r(i) 


units  of  processing  time  if  it  enters  when 


these  are  i  <^  c  Jobs  (including  the  tagged  one)  requesting  processing.   If 

ct 
i  >  c  ,  then   t*  »  —( — r  .   If  i  >  c  ,  then  the  number  of  service  interruptions 

v 
during   t   is  Poisson  with  rate  —  .   Each  time  service  is  interrupted,  the 

c  _1 

tagged  job  spends  an  expected  amount  of  time   (i-c)  [v  +  ur(c)J    in  queue. 

Thus  the  expected  time  spent  in  queue  because  of  service  interuptions  is 

(i-c) (— )t*[v  +  ur(c)]    .   If   i  >  c  ,  the  expected  initial  wait  in  queue  until 

C  -1 

the  tagged  job  starts  service  is   (i-c) [v  +  ur(c)]    .   The  resulting  approxi- 
mation to  the  expected  response  time  is  (see  (2. A)) 


c.N 

V      lt   .    V       ct 

=  k  qi  *BT  +  Jc+1  qi  *3 


(6.2) 


I        q.(i-c)[v  +  ur(c)] 
i=c+l 


-1 


vt 


r(c) 


+  1 


Table  1  gives  values  for  the  expected  response  time,   m(t)  ,  and  the  above 
approximation  for  various  values  of   X,  u,  v,  c,   and   t   for  r(j)  5  1   and 
N  =  25  terminals.   The  quality  of  the  approximation  (6.2)  appears  to  be 
excellent  for  all  cases  considered. 

(c)   Numerical  Implications. 

Aspects  of  the  behavior  of  m(t)   to  be  noted  from  the  table  are  as 
follows.   If  the  amount  of  processing  time  required,   t  ,  is  "small",  then 
expected  response  time  is  minimized  when  c   is  maximized  (here  c  =  25) ;  that 
is,  when  there  is  maximal  processor  sharing  and  no  outside  queue.   If   t   is 
"large",  then  expected  response  time  is  minimized  when  c  =  1;   that  is,  when 
the  processor  is  dedicated  solely  to  the  job  that  is  being  served,  and  other 
jobs  queue  outside  in  turn.   Note  that  increasing  the  rate  of  service  inter- 
ruptions by  changing  v   can  either  increase  or  decrease  the  expected  response 
time,  depending  upon  job  time  requirements. 

These  behavioral  aspects  also  appear  by  taking  derivatives  of  the  approxi- 
mate average  response  time  A  .   In  particular, 


3v 


A  \ 


<  0 

if 

=   0 

if 

>  0 

if 

'<*. 
'-*. 


t  > 


If  r(j)  El    j  =  1,...,N, 

1 
increasing  in  c  if   t  >  —  , 


then  A  is  decreasing  in  c  if 
and  A  is  constant  in  c  if 


t  < 


1 
t  =  — 

y 


A  is 


12 


Finally,  arguments  similar  to  those  in  Section  3  will  show  that  the  re- 
sponse time  is  approximately  normally  distributed  when  the  required  work  is 
large.   Again,  details  will  be  provided  in  later  work. 
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APPENDIX 


To  show  (2.7)  and  (3.10)  yield  the  same  value  for  E([R|w(t)  =  t]   for   t 
large  for  the  case  u,  =  ur(j)  . 


First: 


q(j)   - 


x    X 


1=1 


Vl   X 


>1 


Hence, 


■  ■  I 


q. 


w r(J)  MJ 


00  •  X_     X     •  •  •     x    X.     -t 

=  c  y  -J 5 3=k 

j-l r(j)  yl x  ••• x  Vl 


Since, 


y.  -  yr(j) 


m  =  cy 


-  .  a0  *  ...  x  a.^) 

j=l  ^   x    ...    x   Mj 


cy 


n  u. 


£      j(XQ    x    ...    x    \         ) 


k=l 


n      u 
i-j+l 


Further, 


c  = 


n  y. 

k-1 1 


J      X      x   ...    x   x  n  y 

1-1  J         k=j 


Hence, 


m  =  y 


*    j(Ao 

J=1 


x    .  . 


•  x  Yi} 


n      m 
lk=j+l     kJ 


1-1 


'J-1 

n    x. 
i-i    x 


I    <Ao  x  • 
j-i 


..     KX  ) 


lk=J  J 


j-l 

n    x 

i=l 


n      y, 


k=j+l 


11         ^ 


where  ^ixXn  =  1  by  convention. 
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Second: 


ir+(j) 


M2    x    ...    x    „ 


(j+l)TT»(0)  j     =     0,1,... 


tX1    x    XQ    E    1] 


where 


tt'CO) 


n    y, 
k=2 


I    j(x-  x  ...  x  x      )      n      u 

j=l  J  k=j+l     k 


r(j+l)      , 


r(j) 


j=0     J+1  j=l      J 


°°  X      x    . .  .    x    X 

I     r(j)      t     K    _        .  .I-1  «'W 


j-1  ^2  *    •••   *  wj 


X,       X       ...       X      \ 

1 jdL 


-£    I 


y  j=!     ^   X    •"    X   yj-l 


Tf'(0) 


since 


y.+1  =  yr(j+l)    .      Thus 


tt'(O)        1 


n    y 
k=2     ' 


-      7      A,    x    ...    x    X.    . 

y  ^    1  J-- 


.n.  ^k 


GO 

J         (A      x    ...    x    A 
_  J=l  1 

=     *<— r 


k=J 


I     J(X. 


k=j+l       d 


A  comparison  of   £  with  m  shows  that 


1 
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